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ABSTRACT 

This study examined perceptual and articulatory 
confusions among the fricatives /f, v, s, z/ and voiced and unvoiced 
”th” in preschool children« (These phonemes are among the most 
difficult for children to articulate-) Seventeen children from 
3-3-5- 1 years of age were tested on syllables formed by taking all 
combinations of the six fricatives in initial (CV) and final (VC) 
position with one of the vowels /a, i, aI/« Discrimination and 
articulation tests of this syllable set were then given- An 
audiovisual system was used to test discrimination- The fricatives 
were substantially more difficult to discriminate than the control 
items- Two contrasts, /v/ and voiced ”th” and /f/ and unvoiced ”th'’, 
were particularly difficult- There were fewer errors on VC than CV 
pairs- No differences were associated with the vowel, but there was 
some evidence that vowel duration was employed as a cue for 
discrimination of fricatives in final position. While there Was a 
significant correlation between the number of discrimination and 
articulation errors per child, there was little evidence that 
articulation errors on specific phonemes were accompanied by 
discrimination errors on the same phonemes, and no evidence that the 
frequency of occurrence of specific phonemes in children's speech was 
substantially related to discrimination or articulation- (Author/AMM) 
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focuses on contributing to a better understanding of cognitive learning by chil- 
dren and youth and to the improvement of related educational practices. The 
strategy for research and development is comprehensive. It includes basic re- 
search to generate new knowledge about the conditions and processes of learn- 
ing and about the processes of instruction, and the subsequent development of 
research-based instructional materials, many of which are designed for use by 
teachers and others for use by students, These materials are tested and refined 
in school settings. Throughout these operations behavioral scientists, curriculum 
experts, academic scholars, and school people interact, insuring that the results 
of Center activities are based soundly on knowledge of subject matter and cog- 
nitive learning and that they are applied to the improvement of educational practice. 
This Technical Report is from the Language Concepts and Cognitive Skills 
Related to the Acquisition of Literacy Project in Program 1, General objectives 
of the Program are to generate new knowledge about concept learning and cogni- 
tive skills, to synthesize existing knowledge, and to develop educational ma- 
terials suggested by the prior activities. Contributing to these Program objec- 
tives, this project's basic goal is to determine the processes by which children 
aged four to seven learn to read and to identify the specific reasons why many 
children fail to acquire this ability. Later studies will be conducted to find 
experimental techniques and tests for optimizing the acquisition of skills needed 
for learning to read. 
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ABSTRACT 



This study examined perceptual and articulatory confusions among the 
fricatives /f, v, s, z, 0, 8/ in preschool children. These phonemes were se- 
lected because they are among the most difficult for children to articulate. 
Seventeen children between 3,3 and 5,1 years old were tested on syllables 
formed by taking all combinations of the 6 fricatives in initial (CV) and final 
(VC) position with one of the vowels /a, i, al/, Discrimination and articu- 
lation tests of this syllable set were then administered, An audiovisual 
system was used to test discrimination by the following procedure. The child 
heard a syllable (e.g,, ”/fa/") over earphones, and simultaneously saw a 
brightly colored animal drawing on the left side of the visual display. Next, 
a second syllable (e.g,, "/va/") was heard, and a second animal appeared to 
the right of the display. Then both animals were shown and the child was 
asked to identify one of the syllables (e.g,, "Who said /va/") by pressing a 
glass panel over the appropriate animal. Each fricative pair differed only in the 
fricative phoneme (Q.g., /fa— va/), vowel and position being identical. Control 
pairs differing on several phonetic features were interspersed throughout the 
test series (e.g., /a8— ki/). In the articulation test, the child repeated each 
syllable after the experimenter. 

The fricatives were substantially more difficult to discriminate than the 
control items (error rates of ^8% and 13%, respectively). Two contrasts, /v-8/ 
and /f-0/. were particularly difficult. There were fewer errors on VC than CV 
pairs. N' differences were associated with the vowel, but there was some evi- 
dence that vowel duration was employed as a cue for discrimination of fricatives 
in final position. While there was a significant correlation between the number 
of discrimination end articulation errors per child, there was little evidence that 
articulation errors on specific phonemes were accompanied by discrimination 
errors on the same phonemes. Discrimination and articulation errors both re- 
sembled the pattern of errors found in tests of phoneme perception in adults. 
There was no evidence that the frequency of occurrence of specific phonemes 
in children's speech was substantially related to discrimination or articulation. 
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INTRODUCTION 



While considerable information has been 
collected over the past 20 years on the acous- 
tic cues which adults use to distinguish one 
phoneme from another, there has been rela- 
tively little research on phoneme discrimina- 
tion by children. One possible reason for the 
scarcity of such studies is the difficulty of 
testing young children. In the present study, 
a speech discrimination test designed for pre- 
school children was used to determine those 
features of tlie speech signal by which they 
made phonemic discriminations. The study 
concentrated on fricatives, since available 
evidence suggested that these constitute the 
major source of articulation and discrimina- 
tion problems in kindergarten and first-grade 
children (Templin, 1957; Bricker, 1967). 

Research at Haskins Laboratories has 
identified two features of the acoustic signal 
which allow discrimination among fricatives 
preceding vowels (Harris, 1958; Delattre, 
Liberman k Cooper, 1963). The frequency 
range of tlie friction, i.e., the initial burst of 
noise, is of primary importance. For/s/ and 
/z/ the friction is in the range above 3500 cps, 
while for/§/ and /z/, the lower limit is 2000 
cps. For the contrasting pairs/ f-v/ and 
/0~ts/, friction extends in both cases from 
1000 cps up. Hence, these pairs are differ- 
entiated solely on the basis of the transitional 
portion of the signal, which proved to be rela- 
tively weak. Other studies using natural stim- 
uli have shown /f/ and /o/ are most frequently 
confused in normal speech, followed by /v/ 
and /is/ (Strevens, I960; Hughes & Halle, 

K-56). 

Duration of both the consonant and vowel 
may constitute significant cues in identifica- 
tion of fricatives which, unlike stops and 
plosives, can be sustained. Miller and Nicely 
(1955) suggest that length may be important 
in the isolation of/s/, /z/, /s/, and/z/. 
Parmenter and Trevins (1935) showed sizeable 
differences in articulatory duration of selected 
fricatives (e.g.,/z/, 110 ms; /s/, 100 ms; 

/f/, 91 ms; /v/, 63 ms; /is/, 61 ms). Peterson 



and Lehiste (I960) found that average vowel 
duration was longer by 30 to 100 ms before 
final voiced fricatives than before voiceless 
fricatives. Vowel duration was not affected by 
the initial consonant. The cue of duration 
should ■'■herefore tend to facilitate VC discrim- 
ination, but not CV discrimination.* 

The significance of durational cues is 
further indicated by Denes' (1955) study on 
the effect of duration on the perception of 
voicing for/s/ and /z/ in final position. He 
asked his subjects to report either the verb 
or noun form of the word use. When the word 
ended in /s/ but had the durational charac- 
teristics of/z/, the sound was heard as /z/. 

When the word ended in/z/ but had the dura- 
tional characteristics of/s/, the final sound 
was perceived as /s/. Subsequent investiga- 
tion indicated that perception of the consonant 
as voiced increased as the ratio of the con- 
sonant to vowel duration decreased. 

There is some evidence that interaction 
between the consonant and the adjacent vowel 
may influence perception of consonants. How- 
ever, the effect is most noticeable for the back 
(velar) consonants (Fletcher, 1953; Licklider & 
Miller, 1951), and hence it is of relatively little 
consequence for fricatives. Position of the con- 
sonant, preceding or following the vowel, also 
appears to affect both the intensity and discrim- 
inability of the consonant, but the data are scanty 
and puzzling. Thus, Black ( 1950) found con- 
sistently greater pressure in production of initial 
consonants, while Tolhurst (1949) found that 
voiceless consonants were slightly easier to 
recognize in final position than voiced ones. 

The purpose of the study reported here 
was to test the ability of preschool children to 
discriminate among the fricatives /f/, /v/, /o/, 
/is/, /s/, and /z/ in both CV and VC pairs. The 
experiment also permitted evaluation of the re- 
lationship between discrimination and articula- 
tion performance on this set of fricatives. 



^Consonant is abbreviated C, vowel by V. 
Hence CV means consonant-vowel as in /fa/. 
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METHOD 



MATERIALS 

Pairs of syllables were prepared so that 
each of 6 consonants, /f/, /v/, /0/, /s/, /s/, 
and /z/, was paired with each of the remain- 
ing consonants, yielding 15 pairs. Each pair 
was tested 6 times, once with each of the 3 
vowels /a/, /i/, and /al/ in VC or CV order. 

In other words, the vowel and order were 
fixed for each of the two syllables in any test 
pairing so that /fa—va/ was tested, but not 
/fa— vi/ or /fa-av/. 

In addition to the 90 syllable pairs de- 
scribed above, 7 maximally distinguishable 
control pairs were selected. In each control 
pair, a different vowel was used in each 
syllable; VC order was used in one syllable 
and CV in the other; and the consonants were 
from different manner classes (e.g., a frica- 
tive and a plosive). An example of a control 
pair is /a6-ki/. By mistake, the vowel was 
not changed in one of the control pairs, /az- 
ta/. 

Recording of the stimulus materials was 
done in a radio studio, using a Telefunkin 
ELA-M-251E condenser microphone. Record- 
ing was with an Ampex 351 recorder using 
Scotch 311 Tenzar tape at 15 ips. The speaker 
was a male graduate student majoring in lin- 
guistics. His dialect most closely approxi- 
mated Upper Midwest (Wise, 1957). 

To determine the quality of the recording 
and the difficulty of the task for mature 
speakers, 6 parents of the preschoolers were 
administered the same tests as the children 
received. There were two errors out of 444 
opportunities, an error rate of .5%. 

The visual stimuli consisted of 48 dis- 
tinctive "pseudo-Seuss " figures of animals 
and common objects printed in bright, two- 
color combinations. 

PROCEDURE 

Each trial consisted of sequences of 
auditory and visual events. Auditory stimuli 



were presented by a Uher 5000 recorder through 
Telex HFR-91 earphones. The original record- 
ing was re-recorded at 3.75 ips, which was 
the fastest speed on the Uher. An MTA 400 
Scholar served for display of the visual stim- 
uli. This device is essentially an elaborate 
memory drum. The display area was covered 
by two one-way vision panels, 3.5 x 5 in., to 
the right and left sides, light-shielded to form 
two separate display areas. Either of the two 
displays could be viewed when the correspond- 
ing panel was back illuminated. The child 
responded by pressing one of the display 
panels, which were mounted on sensitive 
micro-switches. 

A typical trial might follow this sequence, 
as an example. The left picture was illumi- 
nated on the MTA for 2 sec., and the child 
simultaneously heard /fa/. Then the left pic- 
ture was turned off; the right picture was 
turned on for 2 sec.; and /va/ was presented. 
Next both pictures were illuminated, and the 
child heard "Who said /fa/?" The child was 
allowed to take as long as he v^^ished to re- 
spond. A lockout system prevented responses 
except during the response period. If the re- 
sponse was correct (in this case, if the left 
panel was pressed), a green light at the top 
of the MTA was turned on; the right stimulus 
was turned off; and the child heard "I said 
/fa/." If the response was incorrect, a red 
light was turned on, and after a 2-sec. delay 
the child was allowed to correct his error, 
after which the events proceeded as after a 
correct response. During the 4-sec. intertrial 
interval, a new pair of pictures moved into 
the display area. 

Testing was conducted over a period of 
seven weeks, each child being tested once 
per week. During the first week, a pretrain- 
ing test of 10 pairs was given. The first 7 
pairs were pictures of real animals accompanied 
by the sounds of those animals as mimicked by 
a human voice. The last 3 pairs consisted of 
pictures and nonsense syllables similar to 
those used in the experimental sessions. 
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During each of the six experimental ses- 
sions which followed, 15 fricative pairs and 
7 control pairs were tested. During each ses- 
sion, each of the 15 basic fricative contrasts 
was tested once. The vowel remained constan't 
throughout a session, and roughly the same 
number of CV and VC orderings appeared (7 of 
one, 8 of the other). 

A single random order of each of the six 
lists was prepared (cf. Appendix). The first 
two pairs were always control items. Order 
of presentation of the six lists was counter- 
balanced over subjects. 

During both of the last two sessions, an 
18-item articulation test was given following 
the discrimination test. The 36 experimental 
syllables were assigned randomly to one of 
two test lists of 18 syllables each. The child 
was asked to repeat each item after the ex- 
perimenter said it. The child's responses 
were transcribed by a second experimenter and 
were also recorded (Uher 5000 recorder, Uni- 
dyne-Shure 545L lavalier microphone). 

At the first session, each child was given 
the following instructions: 

We are going to play a game which has 
pictures and also sounds. You will see 
some pictures of animals on this machine 
[indicating the MTA] and you will hear a 
man say the sounds that these animals 
make through these earphones. After you 
have seen two pictures the man will ask 



you a question and you can answer him 
by pressing the glass over the correct 
picture. Let's try it. [Experimenter demon- 
strates.] After a while you will see some 
funny pictures and you will hear a man 
say a sound for each picture. You will 
have to listen very carefully and answer 
the man's question in the same way as 
before. 

The experimenter reinforced correct re- 
sponses by comments such as "good" or 
"fine." Comments such as "listen carefully" 
were used for incorrect responses. Subjects 
were also rewarded at the end of all but the 
first session with candy or a small toy. 

Each session lasted approximately 10 
minutes. Testing was conducted in a mobile 
trailer which was relatively quiet but not 
totally soundproof. 



SUBJECTS 

Seventeen children between 3 years 4 
months and 5 years 1 month of age were re- 
cruited during the summer of 1965 from a 
graduate-student housing area at the Univer- 
sity of Wisconsin, Madison. There were 10 
boys and 7 girls. Some of the subjects had 
attended nursery school, but none had gone to 
kindergarten. 
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RESULTS 



DISCRIMINATION TEST 

Seven control pairs were tested on each 
of the six sessions. Mean errors ranged from 
10% to 18% for these items over sessions, 
with a mean of 13%, There was no systematic 
change in error rate over sessions. The num- 
ber of errors per subject ranged from 1 to 16 
(maximum possible 42), Since the control 
items were maximally discriminable synthetic 
syllables, it can be argued that the discrimina- 
tion task was difficult, but not beyond the 
children's capacity, and that the pretraining 
was adequate. Performance improved with 
age; the eight children in the sample who 
were 4 years old or younger had an error rate 
of 18%, the nine older children, an error rate 
of 9%, 

The overall mean error rate on the frica- 
tive pairs was 28%, As expected, these con- 
trasts proved substantially more difficult for 
the children. There was no consistent trend 
in the error rates for these items over ses- 
sions, The number of errors per child ranged 
from 14 to 3 8 (maximum possible 90), There 
was a relatively high correlation between the 
number of errors on the control pairs and the 
fricative pairs (r - ,55, p < ,01), 

Two analyses of variance were carried 
out to determine the effects of variation in 
vowel, position of the consonant in the syl- 
lable, and specific pair, (Recall there were 
three vowels and 15 pairings of the six frica- 
tives,) For analysis of the vowel effect, error 
scores were summed over the position vari- 
able, yielding for each subject an error score 
of 0, 1, or 2 for every vowel-pair combina- 
tion, There were no differences in performance 
associated with the three vowels {F < 1), nor 
was the vowel-by-pair interaction statistically 
reliable, iP(28,448) = 1,49, p< ,10, There 
were differences between pairs, to be dis- 
cussed below. 

In the second analysis, the error scores 
were summed over the vowel variable, yield- 
ing a score of 0—3 for each position-pair 



combination. There were significantly fewer 
discrimination errors when the consonant fol- 
lowed the vowel (24% errors) than when it 
preceded the vowel (31% errors) ,jP( 1 , 16) = 
9,35, p< ,01. The interaction between posi- 
tion and pairs was also statistically signifi- 
cant, jP(14,224) = 1,79, p< ,05, Investigation 
of specific combinations (Table 1) showed 
that the pairs /s-s/ and /f— v/ were rela- 
tively much easier as VC than CV, whereas 
/v— z/ and /f— s/ were much easier as CV than 
VC, contrary to the overall trend. 

Differences between the mean errors per 
pair in the lowest row of Table 1 were also 
statistically significant, F(14,224) = 6,13, 
p < ,01, The results of a Duncan range test 
of these means are also indicated in Table 1, 
Discrimination of pairs /v— S/ and /f-O/ was 
at the chance level, while /O-z/ and /s-z/ 
tended to be easier than the other pairs. 



ARTICULATION TEST 

Each child pronounced 3 6 fricative syl- 
lables, all combinations of 6 fricative phonemes 

3 vowels, and CV or VC, Aside from fricative 
substitutions, there were only 4 other errors: 
one child added a final /-b/ to /sa/ and /za/, 
there was one vowel substitution, and in one 
instance /h-/ was placed before /av/. These 

4 errors were disregarded in the analysis of 
the articulation data, the summary results of 
which are shown in Table 2. The number of 
articulation errors ranged from 2 to 16 per 
subject, with a mean error rate of 21%, There 
were 75 CV errors and 56 VC errors; just as 

in the discrimination test, CV items were more 
difficult than VC, The correlation between the 
number of errors by a child on the articulation 
test and the fricative discrimination test was 
substantial (r = .62, /)< ,01). The number of 
errors on the articulation test was not as 
closely relate(^ to discrimination of the con- 
trol pairs (r = AQ, p< .05), It is very pos- 
sible that these correlations are the result 
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of age differences, as reported by Templin 
(1957). 

No statistical evaluation of the data in 
Table 2 was attempted, but certain trends are 
apparent. First, there were no errors in man- 
ner of articulation; all of the responses given 
were from the subset of fricatives used in the 
study. Second, the pattern of substitutions is 
quite similar for initial and final consonant 
positions. The number of errors involving 
only a change in place of articulation is about 
the same for both positions (45 in initial, 49 
in final). There were more errors in voicing 
for initial than final positions (17 and 2, re- 
spectively) , and more errors in both voicing 



and position for initial than final positions 
(13 and 5, respectively). Of the 131 errors, 

92 preserve the voicing feature. Third, when 
a substitution involves a shift in place of 
articulation, the shift is generally small. The 
majority (59%) of the substitutions come about 
when an interdental fricative (/©/ or /is/) is 
replaced by a labiodental fricative (/f/ or /v/). 
Next most common (19%) is replacement of an 
alveolar fricative (/s/ or /z/) by an inter- 
dental fricative. There are only two occur- 
rences of an interchange between labiodental 
and alveolar positions, the largest possible 
change in place of articulation for these ma- 
terials . 



Table 1. Mean Probability of an Error in the Phoneme Discrimination Task, 
Averaged Over Vowels 



Pair 


0 


s 


f 


is 


V 


f 

6 


f 


e 


e 


s 


e 

s 


S 


f 


6 


f 

e 




z 


z 


z 


s 


z 


s 


s 


V 


V 


z 


V 


V 


CV 


18 


18 


20 


33 


16 


26 


22 


30 


30 


33 


39 


37 


51 


45 


55 


VC 


8 


10 


20 


8 


30 


20 


28 


22 


24 


26 


24 


34 


21 


52 


51 


Mean 


13 

• 


14 


20 


21 


23 


23 


25 


26 


27 


30 


32 


35 


36 


49 


53 


Range 

Test 

































Note. — Any two entries in the Mean row not underlined by the same line are significantly 
different,/* < .05, by a Duncan range test. Decimals have been omitted. 



Table 2. Frequency of Fricative Substitutions in Articulation Test 



Stimulus 






Substitution 






f 


V 


6 is 


s 


z 





f 


— 


-- 


1, 


0 


1, 


3 


0, 


0 


0, 


0 


0, 


0 


V 


4, 


1 


-• 


— 


0, 


1 


0, 


0 


0, 


0 


0, 


0 


e 


19, 


11 


0, 


0 


-- 


-- 


1, 


0 


1, 


2 


0, 


0 


is 


7, 


2 


13, 


25 


3, 


1 


-• 


-- 


0, 


0 


0, 


0 


s 


1, 


0 


0, 


0 


6, 


4 


0, 


0 


-■ 


— 


1, 


0 


z 


0, 


0 


0, 


1 


6, 


2 


3, 


2 


7, 


0 







Note. — First entry in each cell is for CV position, second entry is for VC. 
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IV 

DISCUSSION 



The major point to be made about these 
data is that they follow the pattern of con- 
fusions found in speech perception by adults. 
In other words, if one asks, What pairs of 
fricatives are difficult for young children to 
articulate or discriminate? the answer is. 
Those pairs which adults most often confuse. 
The classic study of Miller and Nicely (1955) 
serves well as a reference. Adults were pre- 
sented with CV items in a noise background 
and asked to identify' each consonant. The 
vowel was held constant throughout as /a/. 
They described their results by a model based 
on three articulatory features — manner and 
place of articulation, and voicing,^ A similar 
analysis gives considerable order to the 
present data. In Table 3, proportion of dis- 
crimination errors for various pairs are 



^Confusions among the stimuli tended to 
preserve certain features more than others. 

In particular, voicing was least likely to be 
lost. Thus, for the stimulus /ga/, the most 
common substitutions were /da/, and /za/, 
while /ka/ was relatively uncommon. For 
/da/ , place of articulation is wrong but man- 
ner and voicing correct, whereas for /za/, 
manner is incorrect, place and voicing correct. 
Menyuk (1968) has demonstrated the applica- 
tion of a feature-analysis model to articulation 
errors in young children. Also of, Jakobson, 
Fant, and Halle, 1963, 



arranged according to place and voicing char- 
acteristics, This analysis indicates that the 
highest error rates occurred when neither ar- 
ticulatory dimension provided a strong cue 
(i.e,, when there was identical voice charac- 
ter and the place difference was small). When 
either place or voicing provided a suitable 
cue for differentiation, error rates were sub- 
stantially lower. There were, however, no 
large differences associated with the size of 
the place difference when voicing was differ- 
ent, nor was the voicing cue more salient than 
the place cue. Moreover, the combination of 
a voicing difference with a large place dif- 
ference did not make it easier to distinguish 
a pair. The relations between articulation and 
discrimination data from the present study, 
the Miller-Nicely confusion matrix based on 
adults, and the freguency of occurrence of 
the phonemes in the speech of young children 
are examined by the analysis presented in 
Table 4. The first entry in each cell of the 
half-matrix is the rank-order difficulty of 
discriminating the pair in the present study. 

The second entry is the rank-order difficulty 
of articulation in the present study. The con- 
fusion matrices in Table 2 were collapsed 
over CV and VC, and then the half-matrix was 
formed by "folding" on the main diagonal. 
Visual examination of Table 2 suggests that 
collapsing over VC and CV does no injustice 
to the data. In the half-matrix, certain asym- 
metries in the tables are disregarded, but 



Table 3. Proportion of Errors in Discrimination Task as a Function 
of Voicing and Place-of-Articulation Features 







Place of Articulation 


Voicing 


Same 


Small Difference Large Difference 


Same 


— 


.41 ,24 


Different 


,27 


.21 ,25 
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these are relatively unimportant for purposes 
of comparing articulation and discrimination 
data.^ 

The third entry is from Miller and Nicely 
( 19555 Table III, frequency response 200- 
6500 cps, S/N = -6 db), where again a half- 
matrix was formed by folding a full matrix. 

The lowest entiy is based on data from Poole's 
(1934) study of the frequency of occurrence of 
consonants in the free speech of preschool 
and primary children/ She found a wide range 
in the frequency per hundred words of the 
fricatives used in the present study (/s/, 12,5; 
/■fi/, 9.5; /z/, 7.0; /v/, 4.0; /f/, 3.5; /O/, 1.5). 
To obtain a measure based on her count that 
could be compared with our data, for each pair 
in Table 4 the total frequency of occurrence of 



^In the articulation data in Table 2, /f/ 
and /v/ are frequently substituted for /O/ and 
/S/, respectively. In fact, these two substi- 
tutions account for 53% of all the errors in 
Table 2. Considering only the fricative sub- 
set in Bricker' s ( 1967) study of articulation 
of CV syllables (V = /a/) by children from 3 
to 6 years of age, these two substitutions 
constituted 13 to 25% of the errors depending 
on the age group. Substitutions in the opposite 
direction (/0/ for /f/, /s/ for /v/) were much 
less common, 5% in our study, 2 to 5% in 
Bricker' s. This asymmetry may be the result 
of response bias: /f/ is more common in 
children's speech than /©/ according to both 
Poole (1934) and Carterette and Jones (1965), 
On the other hand, /v/ is relatively infrequent, 
although /s/ occurs with high frequency in 
part because of its appearance in initial 
position in the closed set of function words. 

In the data of Miller and Nicely ( 1955), the 
probability of replacing an interdental by a 
labiodental is about double the probability 
of the reverse substitution. These asymmetries 
suggest that further work is needed with dis- 
crimination tasks where response bias is 
more adequately controlled. The presence of 
such asymmetries in the perceptual process- 
ing of the speech signal would seem to raise 
problems for simple feature analysis models, 
'^Poole's phoneme count agrees well with 
the more recent data of Carterette and Jones 
(1965) on first-grade children. If the counts 
are normalized on totals over the six frica- 
tives used in study, the proportions for each 
fricative from Poole and Carterette-Jones, 
respectively, are: /s/, ,328, .341; /S/, .250, 
.212; /z/, ,184, .209; /v/, .105, .078; /f/, 

.092, ,105; /O/. .039, .051 . 



the two elements of the pair was computed, 
and then the set of pairs was rank ordered on 
this total. Thus, the pair/O-f/ had a total 
frequency of 5.0 and was assigned a rank 
order of 1, being least frequent. 

Rank-order correlations were then calcu- 
lated for the various combinations of the four 
measures in Table 4. Correlation between the 
rank-order difficulty of articulation and dis- 
crimination in the present study was mar- 
ginally significant (p « .34, p< .10). Corre- 
lation of discrimination and articulation by 
the preschoolers with the Miller-Nicely data 
was reasonably high: ,59 for discrimination, 
,62 for articulation,/) < .01 in both cases. 



Table 4, Half-Matrix of Rank Order of 

(1) Discrimination and 

(2) Articulation Data from Present 
Study, (3) Miller-Nicely Confusion 
Data from Adult Subjects, and 

(4) Poole Count of Frequency of 
Occurrence in Preschool and 
Primary Grade Children 



First 
Member 
of Pair 


Data 

Set 


Second Member of Pair 


V 


0 


S 


s 


z 


f 


1 


3 


1 


10.5 


9 


13 




2 


7 


2 


4 


11 


14 




3 


8.5 


1 


7 


4 


14 




4 


3 


1 


8 


11 


5 


V 


1 




7 


2 


6 


10.5 




2 




11 


1 


14 


11 




3 




10 


2 


13 


6 




4 




2 


9 


12.5 


6.5 


0 


1 






5 


8 


15 




2 






8.5 


3 


5.5 




3 






11 


3 


12 




4 






6.5 


10 


4 


» 


1 








12 


4 




2 








14 


8.5 




3 








15 


5 




4 








15 


12.5 


s 


1 










14 




2 










5.5 




3 










8.5 




4 










14 



Note. — See text for more detailed descrip- 
tion of how rank orders were computed. For 
each data set. Rank 1 indicates highest error 
rate (or lowest frequency). Rank 15 lowest 
error rate. 
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None of the other correlations approached 
significance. In particular, correlations be- 
tween the frequency index and other perform- 
ance measures Vv^ere uniformly low (p < .25). 
Inspection of Table 4 shows that the frequency 
predictor is off in several comparisons, Neither 
/0— v/ nor /0-z/ were as difficult for the 
children to discriminate as the frequency 
measure would predict. Also, /ts—v/ is a very 
difficult pair, although both phonemes are 
common in children's speech. Where the fre- 
quency predictions are in closest agreement 
with the data (e.g., /s-S/ is easy and /0-f/ 
difficult), the predictions are similar to those 
of a feature analysis. 

In short, frequency of occurrence (at 
least, the gross measure obtained by Poole) 
is not a reliable indicator of the ease with 
which young children discriminate or articu- 
late fricative consonants. On the other hand, 
there is a reasonably close relationship be- 
tween both articulation and discrimination 
performance in young children and the per- 
formance of college students in a perceptual 
recognition test. 

It is pertinent at this point to consider 
the relation between articulation and dis- 
crimination. A child is unlikely to produce 
in a consistent fashion contrasts that he can- 
not hearj on the other hand, not all contrasts 
that can be heard need be produced in speak- 
ing. The semivowels /r/ and /!/ are frequently 
misarticulated, but there is no good evidence 
that discrimination of these phonemes is par- 
ticularly difficult.® 

Our discrimination data agree closely 
with predictions based on the acoustic prop- 
erties of the various fricatives. The frequency 
range of the friction is sufficient for identifi- 
cation of /s/ and /z/. This cue does not per- 
mit discrimination between the interdentals 
/0— s/ and labiodentals /f-v/ for which the 
weaker, transitional cues must be used. 
Averaging over all contrasts, the mean error 
rate in discrimination of /s/ is 23%, and of 
/z/, 21%. For the other four phonemes, the 
mean error rate ranges from 30% to 32%. 



®For some reason, discrimination of the 
laterals has not been investigated. Research 
from Haskins Laboratories (O'Connor, Gerst- 
man, Liberman, Delattre, & Cooper, 1957) 
would indicate that adequate acoustic cues 
exist to differentiate among the phonemes in 
the set /w, j, r, 1/. Both second and third 
format transitions differ considerably between 
the patterns for /r/ and /w/, although /w/ is 
frequently substituted for /r/. 



The pattern of articulation results differs 
in some significant details from the discrimi- 
nation results. In articulation, the interdental 
position is much more difficult (41% errors) 
than either the labiodental (6%) or alveolar 
(16%) fricatives, Similar error rates were found 
by Bricker (1967): interdental, 72%, labio- 
dental, 37%; alveolar, 25%. His error rates 
were considerably higher, but the interdental 
position is clearly the most difficult. Templin 
(1957) used real words as stimuli and found 
error rates for /0/ and /s/ of around 75% over 
a comparable age range; /f/ and /s/ were 
easier (around 50%), but /v/ and /z/ were as 
difficult as /0/ and /s/ (around 75%), 

In view of the imperfect correspondence 
between discrimination and articulation, it 
may be useful to consider the different cues 
available to a child in the development of 
speech perception. In addition to the informa- 
tion in the speech of others, a child must rely 
on feedback from his own efforts at speech.^ 
Both acoustic and articulatory feedback must 
be taken into account. In producing a spoken 
word, the child hears what he produces and 
matches it against some target representation 
in memory, but he must also identify and retain 
the articulatory sequence used to generate the 
word, if learning is to occur, When develop- 
mental aspects of speech behavior are being 
considered, it is especially important to re- 
member that children may hear and produce 
phonological sequences very differently from 
the manner in which adults (particularly lin- 
guists) operate. It seems quite possible that 
much of the misarticulation observed in chil- 
dren's speech stems primarily from motoric 
difficulties — the motoric commands for cer- 
tain phonemes may be missing, and for other 
phonemes the child may use some approxima- 
tion that is sufficient for communication but 
differs substantially from the motor pattern 
used by adults. 

With regard to "what the child hears," 
the saliency of various cues may well change 
during acquisition of speech. As previously 
mentioned, duration of the friction and tran- 
sition components varies widely from one 
fricative to another, and the vowel duration 
is longer before voiced than unvoiced con- 
sonants. Spectrographic analysis of selected 
items from our discrimination test tape indi- 
cated that before a voiced fricative, the vowel 



^Note that visual cues may also be im- 
portant, particularly for phonemes with labial 
components. We are not aware of any sys- 
tematic work with children on this problem. 
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was longer by 50 to 100% than before an un- 
voiced fricative. The durational cue may ac- 
count for the relative ease with which voiced- 
unvoiced combinations in final position were 
discriminated by the preschoolers (cf, Table Ij 
except for /f-z/, VC for these combinations 
had a much lower error rate than CV) , 

It would be interesting to have more de- 
tailed information about the phonetic environ- 
ment in which children develop speech ability. 
If our interpretation of the data is correct, the 



children tested in this study were quite sen- 
sitive to temporal cues. It may be that, in 
speaking to their children, parents provide 
redundant phonetic cues on several dimen- 
sions in an exaggerated form, in this fashion 
facilitating the child's acquisition of speech. 
Further work with synthetic speech will be 
necessary for evaluation of the relative im- 
portance of these various cues in the develop 
ment of speech recognition. 
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APPENDIX 

STIMULUS PAIRS USED IN DISCRIMINATION TASK 



Set III Set IV Set V 



fa»<'‘ 




dal 


af* 


ip* 


va 


za 


at* 


da* 


iO 




af 


i£ 


va^'‘ 


zal* 


at 


da 


iO* 


az>;« 


ta 


va 


6a>:< 


af* 


as 


Vi 


Si* 


if* 


is 


val 


Sal* 


a0>:‘ 


az 


va 


za* 


10* 


iz 


Vi 


zi>i« 


alO* 


alz 


da 


i0’:< 


zal* 


at 


fa 


id* 


az* 


ta 


a '5 


k i)j{ 


af 


av>!‘ 


as>i‘ 


a0 


if 


iv* 


is* 


iO 


alf 


i% X * 

alv* 


©a’:* 


sa 


ats 


af* 


Si* 


si 


is 


if* 


Sal* 


sal 


af»!‘ 


aO 


za 


sa* 


if* 


iO 


zi 


si* 


alf* 


alO 


zal 


at>!‘ 


da»i‘ 


10 


az 


ta* 


aS* 


ki 


fa 


in)I« 


tSa>i- 


za 


va 


Oa* 


Si* 


zi 


vi 


Oi* 


Sal* 


— iilg...... 

zal 




sa 


az 


af* 


vi* 


si 


iz 


if* 


val* 


sal 


az 






ki 


as 


ki* 


fa>Jf 


_i3_ 


gaJ 


af* 


ats 


a0>i< 


tSa* 


0a 


is 


i0* 


Si* 


0i 


a IS 


alO * 


za>J‘ 


fa 


av 


as* 


zi* 


fi 


iv 


is^'f 


zal* 


fal 


av 


a0’!‘ 


ats* 


az 


iv 


i0* 


iS* 


iz 


alv 


alO * 


az 


as* 


fa* 


0a 


iz 


is* 


fi* 


0i 


alz 


als* 


ip>:« 


va 


az 


ta * 


da* 


i0 


gal 


af 






&'a>!‘ 


fa 


ats 


as>i‘ 


Si* 


fi 


Si 


U JU * 

si* 


Sal* 


va 

fal 


sa 


0 a* 


fa* 


va 


si 


0i* 


fi* 


vi 


sal 


Ual* 


av>:< 


az 


0a 


za^'‘ 


iv* 


iz 


Oi 


zi* 


alv* 


alz 


aS 


ki* 


fa* 


ig 


gal 


af* 


ip* 


va 


zal 


at* 


fa 


sa>i‘ 


av* 


aS 


fi 


si>i‘ 


iv»i‘ 


iS 


fal 


sal* 



Set VI 



az 


ta* 


a6 


ki* 


alf* 


als 


val 


zal* 


fa* 


id 


als* 


alO 


ais 


alf* 


zal 


sal* 


gal* 


af 


val 


Oal* 


alz 


alf* 


ip* 


va 


Sal* 


Oal 


alv 


als* 


alS* 


alz 


fal* 


0al 


zal 


at* 


Sal 


sal>;< 


fal* 


val 


Oal 


zal* 


da* 


iO 


alv* 


3l& 



Note, — Syllable on left was presented first, 
are underlined. 



Asterisk indicates test syllable. Control pairs 
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